The paramagnetic and glass transitions in sudoku 
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We study the statistical mechanics of a model glassy system based on a familiar and popular 
mathematical puzzle. Sudoku puzzles provide a very rare example of a class of frustrated systems 
with a unique groundstate without symmetry. Here, the puzzle is recast as thermodynamic system 
where the number of violated rules defines the energy. We use Monte Carlo simulation to show that 
the "Sudoku Hamiltonian" exhibits two transitions as a function of temperature, a paramagnetic 
and a glass transition. Of these, the intermediate condensed phase is the only one which visits the 
ground state (i.e. it solves the puzzle, though this is not the purpose of the study). Both transitions 
are associated with an entropy change, paramagnetism measured from the dynamics of the Monte 
Carlo run, showing a peak in specific heat, while the residual glass entropy is determined by finding 
multiple instances of the glass by repeated annealing. There are relatively few such simple models 
for frustrated or glassy systems which exhibit both ordering and glass transitions, sudoku puzzles 
are unique for the ease with which they can be obtained with the proof of the existence of a unique 
ground state via the satisfiability of all constraints. Simulations suggest that in the glass phase 
there is an increase in information entropy with lowering temperature. In fact, we have shown that 
sudoku have the type of rugged energy landscape with multiple minima which typifies glasses in 
many physical systems, and this puzzling result is a manifestation of the paradox of the residual glass 
entropy. These readily-available puzzles can now be used as solvable model Hamiltonian systems 
for studying the glass transition. 

PACS numbers: 



Complex systems such as glasses are typified by frus- 
trated interactions which cannot be simultaneously sat- 
isfied. By contrast, systems with a well defined ground 
state are typified by some translational symmetry and 
long ranged ordering. The popular mathematical puzzle 
sudoku lies intermediate between these two. Standard 
sudokus, printed by the thousand in newspapers around 
the world, have a unique solution which normally has no 
symmetry. Adding one "wrong" number-constraint tips 
the problem into the regime of the frustrated system, 
which cannot be solved, while removing one of the clues 
leads to a system with multiple solutions. 

We obtained our sudoku puzzles from the online Playr 
siteQ, and also investigate a minimal 17-clue puzzle [2[ 
Overconstrained and underconstrained puzzles were cre- 
ated by adding or removing constraints from these puz- 
zles. For human solution, the puzzle should be solved by 
"logic" , and puzzles are categorized by how difficult it is 
to do this. The Playr site rates puzzles on a five-point 
scale. 

The sudoku problem can be cast into a Hamiltonian 
form. Here a 9x9 grid contains "spins", taking integer 
values between 1 and 9. The sudoku rules require that no 
"spins" in the same row, column or 3x3 sub-grid should 
take the same value. By assigning an energy to each vio- 
lation of the rules, we arrive at the following Hamiltonian 



assigning fixed values to some spins ( "clues" ) , and treat- 
ing the others as dynamic variables. The solution to the 
sudoku problem is the ground state of the Hamiltonian, 
which has H — 0. It can be seen that this is equivalent 
to a 9-state Potts Hamiltonian on a 20-neighbour finite 
periodic lattice. [1] 

Here, we define microstates of the system with each ct^s 
taking a value between 1 and 9. We then sample the mi- 
crostates according to the Boltzmann distribution with a 
fictitious temperature, t[3|, i.e. the probability of a state 
with energy H is p{H) ~ exp(— 7J/T). This is done by 
accepting or rejecting trial changes of individual numbers 
using the Metropolis algorithm[5|. This Markov process 
could be set to terminate once H = 0, acting as a solver, 
however better methods are available for solving sudoku 
puzzles and this is not the purpose of this work. We 
are interested in the equilibrium macrostate and glassy 
dynamics so we will use the Monte Carlo simulation to 
obtain ensemble averages. 

The mean energy < H > was evaluated, and the spe- 
cific heat 



=< [H- <H>f > /T'^ 



(2) 



(1) 



Where the j sum runs over all sites which share a row, 
column, or sub-grid with i. The problem is defined by 



for representative problems are shown in figure [T] This 
shows a transition between a low temperature "ordered" 
state close to the ground state, and a high temperature 
"paramagnetic" state with many rule violations. The 
variance of the energy shows a distinct peak at this tem- 
perature. These are the classic signals of a phase transi- 
tion, and hereinafter we will refer to it as such, however 
we note that since the system size is fixed there is no 
"thermodynamic limit" . For a system of this small finite 
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FIG. 1: Variation of "heat capacity" with temperature, show- 
ing that the transition temperature shifts with number of 
fixed values. Legend shows the Playr puzzle number, with 
the first digit labelling the "difficulty" and the number in 
brackets the number of fixed sites. 
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FIG. 2: Variation of critical temperature with number of clues 
for puzzles with and without unique solutions. 



size, we expect such a rounded peak rather than a dis- 
continuity. Larger sudoku puzzles with 4x4 and 5x5 grids 
exist, but the Potts-model sites have more neighbours, so 
do these not comprise a limit. The finite size of the phase 
space means that peaks are rounded rather than sharp, 
and consequently we were also unable to obtain sufficient 
data to calculate critical exponents. 

The transition temperature shows a strong dependence 
on the number of fixed sites, but not on the existence of 
a unique ground state solution (Figure [5]) . We investi- 
gated this by taking published puzzles, and either fixing 
additional sites (consistent with the solution) to increase 
that number, or removing fixed sites to create systems 
with multiple minima, or both. 

Two phases discovered here are akin to a high entropy 
paramagnetic phase, which samples the whole phase 
space, and an ordered condensed phase which is close to 
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FIG. 3: Variation of mean energy in the T < 0.2 range for a 
series of puzzles with different number of fixed sites, showing 
the appearance of a glassy state for intermediate numbers of 
clues, and an increased tendency towards glassiness for puz- 
zles with a unique solution 



the solution, while sampling other well-defined, energy 
minimum. The ground state, while unique, has no sym- 
metry, and there are also other macrostates correspond- 
ing to metastable energy minima in which the system can 
become stuck at low temperature. We refer to the phase 
stuck in this rough non-symmetric energy landscape as a 
glass. 

We studied a number of systems and monitored their 
ensemble average low T energy after an equilibration time 
of 200000 MC steps (Fig [3]). In general, this places us 
in a non-ergodic regime with each simulation becoming 
fixed in a local minimum. It is therefore a kinetic rather 
than thermodynamic measure, the non-zero values of < 
H > being due to failing to find the global minimum 
below the glass transition rather than thermal noise. For 
puzzles with multiple solutions the maximum complexity 
(as measured by mean low temperature energy) appears 
with about 27 fixed sites. With more fixed sites there 
is a smaller phase space, and the ground state is quickly 
found. With fewer clues, there is an increasing number of 
zero energy states, and again one is easily located. This 
may be regarded as a degenerate glass. 

This latter argument does not hold for puzzles with a 
unique solution, where equilibration from an initial ran- 
dom state typically falls into a local minimum. Indeed 
we see that the low-temperature ensemble-averaged en- 
ergy for systems with unique solution increases sharply 
towards 17, the minimum possible^. 

The existence of a unique ground state makes it possi- 
ble to be certain when the simulation is in a metastable 
state close to a non-zero-energy minimum. From this 
it is also possible to probe the roughness of the energy 
landscape and determine something analogous to a glass 
transition. At very low temperature the simulation falls 
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into a local minimum and becomes stuck. Then on heat- 
ing we enter the "glassy" regime where a variety of differ- 
ent states are sampled, including the ground state. This 
transition is associated with a broad peak in the specific 
heat on the Cy vs T plot, but a clearer measure of the 
number of local minima can be determined by measur- 
ing the fraction of time that the system is in the ground 
state (FigHJ. We find that there is a correlation be- 
tween both "difficulty" rating and number of fixed sites 
for the number of metastable states: harder puzzles with 
fewer clues spend less time in the global minimum, and 
therefore more in metastable minima. By contrast, the 
peak always falls at the same temperature, showing that 
the energy barriers between the various local minima are 
similar in all puzzles. 

The existence of a particular temperature which is op- 
timal for finding the ground state implies that both above 
and below this temperature, the entropy of the system 
should be higher. This is a more extreme case of the 
residual entropy of glass and disordered crystals 13, III 
in that the sudoku system appears to show a negative 
dS/dT. Such a result, if converted from statistical me- 
chanical entropy to thermodynamic entropy, would im- 
ply a violation of the second law. However there is no 
physical representation of this system, and no reversible 
process equating to our simulation, so the issue is mainly 
one of definition of entropy. 

The information (Shannon) measure is the "configura- 
tional entropy" of the system, defined as: 



k) lnp(CTij = k) 



(3) 



where p{(Jij = k) is the ensemble-averaged probability of 
finding the number k at site (i,j). In calculating this en- 
tropy, the process for evaluating the "ensemble average" 
becomes critical. S{T) is relatively featureless when cal- 
culated from a single long trajectory at each temperature, 
decaying monotonically to S=0 at T=0. When averages 
for Pi are taken over a set of separate simulations (i.e. 
repeatedly annealing to high-T) a distinct entropy min- 
imum is revealed at the glass transition, caused by the 
excess entropy from counting multiple instances of the 
glass at lower temperatures. The variation of S{T) with 
temperature calculated in this way is shown in Fi^ (in- 
set). It can be seen that this minimum corresponds to 
the peak in finding solutions, and is strongly correlated 
with the "difficulty" rating of the puzzle: harder puzzles 
have more local minima. 

For an abstract puzzle like sudoku there is no "phys- 
ically correct" way of evaluating the ensemble average. 
All we can say is that the ergodic hypothesis is clearly 
violated, and with it the link between statistical mechan- 
ics and thermodynamics. 

To summarise, we have shown that the sudoku puzzle 
can be used to define a model Hamiltonian system with 
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FIG. 4: Fraction of time spent in ground state for a series of 
puzzles with different "difficulty" rating, at each temperature, 
we simulated 10^ attempted steps, split into 1000 groups with 
each group initially randomized, and statistics gathered over 
the final 8 x 10^ steps. Legend shows playr puzzle number[l[. 
The solution to the 17 clue minimal puzzle was only found 
once. Inset: S{T) vs T. This quantity is normalized to the 
number of sites, rather than the number of free sites, hence 
systems with a large number of free sites tend to have higher 
S{T) throughout. 



interesting properties. It exhibits three phases, paramag- 
netic, condensed and glass The paramagnetic transition 
has a critical temperature which depends only on the 
fraction of fixed sites in the system. At low temperatures 
there is a notable difference between true sudoku puzzles 
with unique solutions, and the equivalent constrained 
Potts models with multiple solutions: the sudoku puz- 
zles are much stronger glass-formers. Finally, the sudoku 
Hamiltonian sheds light on the residual glass-entropy, by 
providing a model in which ergodicity breaking is mani- 
fest, leading to different results for the low-temperature 
entropy depending on the method of simulation. 
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